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Abstract 


Detecting face masks is essential for maintaining public safety and preventing 
the spread of contagious illnesses. In this article, we give a thorough inves- 
tigation into how Convolutional Neural Networks (CNNs) may improve face 
mask identification. The goal of this work is to provide a reliable and robust 
CNN-based method for identifying people who are wearing masks in practical 
situations. We start by outlining the CNN architecture, which has a sequen- 
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work; tial structure made up of convolutional layers, activation functions, pooling 
CNN; layers, and fully linked layers, and is utilized for facemask identification. The 
COVIDI9; architecture is made to recognize masked and unmasked faces with accuracy 
Transmission modes; , ‘ 5 2 

Public places: and learn hierarchical representations of input photos. Layers are pooled for 
Maskwearing downsampling, fully linked layers are used for high-level representations, and 


activation functions are used to induce non-linearities. We use a number of 
measures, including accuracy, precision, recall, and F 1-score, to assess the 
performance of our CNN model. The accuracy of our experimental findings 
is encouraging, with a 95% overall accuracy in identifying people wearing 
masks. The accuracy in accurately detecting both positive and negative cases 
is balanced, as seen by the precision and recall values, which are determined 
to be 92% and 96%, respectively. We also assess the model’s effectiveness 
in other scenarios, such as those involving several people spread out across 
a wide area. Our findings show that even when people are at different dis- 
tances from one another, there is constant performance with a high accuracy 
rate of above 90%. This demonstrates the model’s capacity to identify masks 
regardless of the distance that people are from the camera. We compare the 
performance of our CNN-based approach to current mask recognition algo- 
rithms and show how it outperforms them, outperforming more conventional 
approaches that generally had accuracy levels of 70-80%. 


1. Introduction 


Use of face masks as a vital preventative step to 
slow the transmission of the virus has been made 
clear by the COVID-19 pandemic. In order to 
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enforce adherence to mask-wearing rules in public 
settings, researchers have concentrated on building 
automatic face mask identification systems utilizing 
convolutional neural networks (CNNs). Drawing on 
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information from pertinent publications, this section 
gives an overview of the context and driving forces 
behind the study of face mask identification using 
CNN. 


Background: The new coronavirus SARS-CoV- 
2 that triggered the COVID-19 pandemic has had 
a significant influence on the world’s economics, 
communities, and health. Respiratory droplets 
released when breathing, speaking, coughing, or 
sneezing are the main means of viral transmission. 
By catching respiratory droplets, face masks serve 
as a physical barrier that dramatically lowers the 
chance of transmission. As a result, governments 
and health organizations all over the world have 
emphasized the significance of universal face mask 
use. Therefore, the creation of automated systems 
that can identify people not using masks or using 
them incorrectly is essential for the effective imple- 
mentation of public health policies. 


Firstly, automated face mask detection using 
CNN improves the efficiency and accuracy of mon- 
itoring and enforcement processes, saving time and 
resources for authorities. These systems can analyze 
real-time video feeds or images from surveillance 
cameras and identify individuals without masks or 
those wearing masks incorrectly. By automating the 
detection process, authorities can promptly identify 
non-compliant individuals, take appropriate actions, 
and ensure public safety more effectively. 


Secondly, CNN-based face mask detection sys- 
tems provide a non-intrusive means of identify- 
ing individuals not wearing masks. These sys- 
tems analyze visual information without requiring 
direct contact or personal information, ensuring pri- 
vacy while promoting adherence to mask-wearing 
guidelines. By deploying these systems in vari- 
ous public spaces, such as airports, shopping malls, 
or transportation hubs, authorities can proactively 
detect and address non-compliance, creating a safer 
environment for everyone. Arsalan, Khan, and 
Naseem (Arsalan, Khan, and I Naseem Jain and 
J. Kaur) provide a comprehensive review of detec- 
tion of face mask techniques during the COVID- 
19 pandemic, highlighting the importance of auto- 
mated systems in ensuring public health. Das, 
Islam, Roy, and Bhattacharya (Das et al.) present a 
deep learning based upon face mask detection sys- 
tem aimed at preventing the spread of COVID-19, 
emphasizing the potential of CNNs in this domain. 
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Dey and Dutta (Dey and Dutta) propose an effi- 
cient method for face mask detection using CNN, 
highlighting the technical advancements and perfor- 
mance improvements in this area. Dwivedi, Keshari, 
Kumar, and Kala (Dwivedi et al.) conduct a com- 
prehensive study on face mask detection techniques, 
shedding light on the various methodologies and 
challenges involved. Fu and Su (Fu and Su) explore 
face mask detection from video sequences, provid- 
ing insights into the temporal aspect of the problem. 


1.1. Importance of Face Mask Detection 


Automated face mask detection systems play a crit- 
ical role in ensuring public health compliance by 
detecting individuals who are not wearing masks or 
wearing them incorrectly Prompt identification of 
non-compliant individuals allows for timely inter- 
ventions, minimizing the risk of virus transmis- 
sion and protecting public health (Gupta, Anand, 
and Saxena) (Islam et al.). Manual monitoring and 
enforcement of face mask usage in crowded areas 
can be labor-intensive and time-consuming. Face 
mask detection systems powered by convolutional 
neural networks (CNNs) offer a solution to opti- 
mize resources. By reducing the need for man- 
ual surveillance, authorities can allocate resources 
more efficiently for other essential tasks (Jain and 
J. Kaur). CNN-based face mask detection sys- 
tems provide a non-intrusive means of identifying 
individuals who are not adhering to mask-wearing 
guidelines. These systems analyze visual infor- 
mation without requiring direct contact or personal 
information, thereby respecting privacy concerns. 
This contactless approach helps maintain social dis- 
tancing and reduces the risk of virus transmission (P. 
Kaur and Sharma). Face mask detection systems can 
operate in real-time, enabling authorities to respond 
promptly to non-compliance situations. When an 
individual is detected without a mask or wearing one 
incorrectly, the system can generate instant alerts for 
security personnel or administrators, enabling them 
to take appropriate actions swiftly. Real-time mon- 
itoring and alerting capabilities are crucial in ensur- 
ing a rapid response and preventing potential out- 
breaks (Li et al.). Beyond the current pandemic, 
face mask detection systems have the potential for 
wider applications in public health and safety. These 
systems can be adapted to monitor compliance with 
other protective measures, such as helmet usage, 
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safety gear in industrial settings, or identifying indi- 
viduals with respiratory illnesses in healthcare facil- 
ities. By leveraging the existing infrastructure and 
technology developed for face mask detection, these 
systems can contribute to overall public safety in 
various contexts (Gupta, Anand, and Saxena) (P. 
Kaur and Sharma). By accomplishing these objec- 
tives within the defined scope, the research con- 
tributes to the development of effective and practi- 
cal face mask detection systems using CNN, thereby 
enhancing efforts to mitigate the spread of COVID- 
19 (Li et al.). 


2. Literature Review 
2.1. Previous studies on face mask detection: 


During the COVID-19 epidemic, face mask detec- 
tion has attracted a lot of attention, which has 
sparked various research projects aiming at creat- 
ing precise and effective methods. The production 
of datasets, the development of algorithms, and per- 
formance assessment are only a few of the research 
that have looked into various facets of face mask 
identification. We shall summarise the major contri- 
butions and conclusions of the earlier research that 
have been done in this field in this part. A Masked 
Face Recognition Dataset (MFRD) including pic- 
tures of people wearing various kinds of masks was 
introduced by Liu et al. (Liu et al.). The dataset 
was used as a standard for measuring the perfor- 
mance of face recognition programmes when masks 
are present. They also suggested a tool for identi- 
fying people who are wearing masks that combine 
face detection, alignment, and identification. 

Convolutional Neural Networks (CNNs) have 
been extensively reviewed by Majumdar and 
Guha (Majumdar and Guha) in their study on face 
mask identification. They explored numerous CNN 
architectures used in various researches, including 
VGGNet, ResNet, and MobileNet. Unbalanced 
classes and various mask kinds were among the 
dataset gathering issues that were identified in the 
review. 

A thorough analysis of COVID-19 face mask 
detection methods was provided by Mazhar et al. in 
their article (Mazhar et al.). They summarised the 
methodologies applied in various research, includ- 
ing conventional computer vision approaches and 
deep learning-based strategies, and spoke about the 
significance of face mask detection for public health. 
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The review emphasised the value of real-time detec- 
tion systems and drew attention to the necessity of 
large-scale annotated datasets. 

A review of face mask identification using 
deep learning models was done by Mehta and 
Patel (Mehta and Patel). They examined numerous 
deep learning architectures used in various research, 
including AlexNet, InceptionNet, and DenseNet. 
The review highlighted the value of data augmen- 
tation strategies as well as the difficulties posed by 
occlusion, posture variation, and different types of 
masks. 

Regarding the COVID-19 pandemic, Ngo et 
al. (Ngo, Nguyen, and Le) performed a survey on 
face mask detection. They provided a summary of 
many research and divided the methodologies into 
two primary categories: conventional approaches, 
and deep learning-based methods. The survey eval- 
uated the effects of dataset quality on the effective- 
ness of detection systems as well as the benefits and 
drawbacks of each strategy. 

During the COVID-19 epidemic, Oruganti 
and Bhattacharyya (Oruganti and Bhattacharyya) 
reviewed face mask detection methods. They talked 
about how important real-time and contactless 
detection technologies are for public safety and 
emphasised the relevance of face mask detection. 
The review examined the methodologies used in 
various research and emphasised the demand for 
reliable and effective algorithms. 

During the COVID-19 pandemic, Padmavathi 
and Janakiraman (Padmavathi and Janakiraman) 
reviewed face mask detection methods. They talked 
about the difficulties brought on by different types of 
mask wearers and emphasised the demand for reli- 
able detection devices in public areas. The review 
examined the effectiveness of several algorithms and 
emphasised the necessity of benchmark datasets for 
impartial assessment. 

During the COVID-19 epidemic, Pathak et 
al. (Pathak, Gautam, and Prakash) did a thorough 
assessment of face mask detection methods. They 
talked about the difficulties presented by different 
mask kinds, poses, and occlusions. The evaluation 
focused on the requirement for quick and affordable 
detection systems by examining the effectiveness of 
various techniques, such as classical machine learn- 
ing and deep learning approaches. 

During the COVID-19 pandemic, Rahimi et 


International Research Journal on Advanced Science Hub (IRJASH) 282 


Enhancing Face Mask Detection Using CNN: A Comparative Study 


al. (Rahimi, Aghaei, and Arabalibeik) performed a 
survey on face mask detection methods. They sum- 
marised the methods applied in various research and 
talked about how crucial face mask detection is for 
stopping the spread of dangerous illnesses. The 
study brought to light the difficulties in gathering 
datasets, developing models, and deploying them in 
the actual world. 

A review on face mask identification using 
deep learning algorithms was done by Rajpal and 
Kumar (Rajpal and Kumar). They spoke about 
the significance of mask detection in public areas 
and emphasised the demand for reliable and pre- 
cise technology. In the review, the effectiveness 
of various deep learning models, including YOLO, 
SSD, and Faster R-CNN, was examined. It was also 
emphasised how crucial dataset variety is for build- 
ing trustworthy models. 


2.1.1. Techniques and approaches used in face mask 
detection: 

Face mask detection techniques have utilized vari- 
ous approaches to tackle the challenges associated 
with mask presence and variations. In this section, 
we will discuss the key techniques and approaches 
employed in previous studies for face mask detec- 
tion. 

1. Conventional Computer Vision Techniques: 
For the purpose of detecting face masks, several 
research have used conventional computer vision 
techniques such as Haar cascades, Histogram of Ori- 
ented Gradients (HOG), and Local Binary Patterns 
(LBP). These techniques frequently involve feature 
extraction followed by classification using manu- 
ally created features or face identification using pre- 
trained classifiers. 

2. Deep Learning-based Methodologies: Con- 
volutional Neural Networks (CNNs), in particu- 
lar, have grown in favour for face mask iden- 
tification methods that are based on deep learn- 
ing. There are several CNN architectures that 
have been used for feature extraction and classi- 
fication applications, including VGGNet, ResNet, 
MobileNet, and EfficientNet. To maximise the 
information gained from sizable datasets, transfer 
learning approaches, where pre-trained models are 
improved on face mask detection datasets, have been 
frequently employed (Ray and Ramani). 

3. Dataset creation and augmentation: Access 
to a wide variety of well-annotated datasets is essen- 
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tial for the training of face mask identification algo- 
rithms. By gathering photos or video frames from 
diverse sources, such as public security cameras, 
social media, and internet resources, researchers 
have generated their own datasets. Rotation, scal- 
ing, and flipping are examples of data augmentation 
techniques that have been employed to expand the 
diversity and size of the datasets. 


4. Evaluation and Training: The labelled 
dataset is fed into the CNN models during train- 
ing, and their parameters are then optimised 
using Stochastic Gradient Descent (SGD) or Adam 
methods. To evaluate the models’ performance 
and choose the optimum hyperparameters, cross- 
validation techniques are used. The efficacy of the 
models is evaluated using metrics including accu- 
racy, precision, recall, and F1 score. 


5. Real-time Detection and Deployment: To 
keep an eye on public areas and enforce mask- 
wearing regulations, real-time face mask detec- 
tion systems have been created. When process- 
ing live video streams or camera-captured pictures, 
these systems apply the taught mask detection algo- 
rithms. To guarantee real-time performance, effec- 
tive implementation strategies are applied, such as 
GPU acceleration and model optimisation. 


6. Integration with Surveillance Systems: In 
order to improve public safety, face mask detection 
systems have been combined with current surveil- 
lance systems. The automatic enforcement of mask- 
wearing regulations and prompt notification of non- 
compliance are made possible by integration with 
access control systems, alarm systems, or public 
announcement systems. 


2.2. Existing challenges and limitations 


Convolutional Neural Networks (CNNs)-based face 
mask detection has come a long way, but there 
are still a number of problems and restrictions that 
researchers and programmers need to solve in order 
to increase the reliability, accuracy, and practical 
use of CNN-based face mask detection systems. 
The large variety of mask variations and occlu- 
sions, which need excellent detection and classi- 
fication independent of these differences, is one 
of the major obstacles. The model’s performance 
can also be impacted by dataset imbalance, when 
instances of mask wearers are greatly underrepre- 
sented compared to instances of non-mask wearers. 
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There are difficulties in adapting CNN models to 
novel contexts and mask kinds. Other crucial issues 
that demand attention include sensitivity to position 
and alignment, real-time performance, and resolv- 
ing ethical and privacy issues. Getting beyond these 
obstacles will aid in the growth. 


3. Methodology 


Due to its capability to accurately extract character- 
istics from photos and categorise them into mask- 
wearing and non-mask-wearing categories, the Con- 
volutional Neural Network (CNN) architecture has 
become a potent tool for face mask identification. 
By automatically learning and recognising sophisti- 
cated patterns and structures in visual input, CNNs 
have revolutionised the area of computer vision. The 
convolutional layers are at the core of the CNN 
design. These layers are made up of filters or ker- 
nels that convolution operations on the input pic- 
ture to extract local information. Each filter extracts 
the most pertinent information from the image by 
specialising in recognising particular patterns, such 
as edges, textures, or forms. The CNN may learn 
hierarchical representations of the input picture by 
employing many filters, gradually extracting more 
complicated and abstract features. 


3.1. Model training and validation: 


The annotated dataset is divided into training and 
validation sets for model training for face mask 
detection. The validation set aids in performance 
monitoring and provides guidance for hyper param- 
eter adjustment while the training set is utilised to 
train the CNN model. The model is either pre- 
trained via transfer learning or initialised with ran- 
dom weights. The updated weights are based on 
calculated gradients from training and are based on 
an optimisation method like SGD or Adam. Gra- 
dients are calculated for weight modifications via 
back propagation. Experimentation is used to fine- 
tune hyperparameters including learning rate, batch 
size, regularisation, and network design. By keep- 
ing an eye on validation loss and choosing the opti- 
mal model, early halting reduces the risk of overfit- 
ting. These actions guarantee that the CNN model 
has been properly trained and optimised for precise 
face mask identification. 
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3.2. Evaluation metrics used: 


The performance of the trained CNN model for 
face mask identification is evaluated using evalua- 
tion measures. The following assessment measures 
are frequently used:- 


¢ Accuracy: It quantifies how accurately the 
model’s predictions were made overall. It is 
described as the proportion of accurately cate- 
gorised samples to all samples. For unbalanced 
datasets, where the proportion of masked and 
non-masked samples changes greatly, accuracy 
alone might not be adequate. 


Precision and Recall: Accuracy and memory 
Precision is the percentage of incidents accu- 
rately categorised as positive (such as people 
wearing masks) among all cases projected to 
be positive. The percentage of accurately cat- 
egorised positive events among all really posi- 
tive instances is measured by recall, also known 
as sensitivity. The model’s ability to accurately 
detect positive cases is revealed by precision 
and recall. 


¢ Fl score: The FI score offers a balanced 
assessment metric by combining accuracy and 
recall into a single rating. It is particularly 
helpful when dealing with unbalanced datasets 
since it is the harmonic mean of accuracy and 
recall. 


Confusion matrix: The amount of true posi- 
tives, true negatives, false positives, and false 
negatives are displayed in a confusion matrix, 
which offers a thorough assessment of the 
model’s predictions. It enables a more thor- 
ough examination of the model’s performance 
and might point out certain areas that require 
improvement. Area Under the Curve (AUC) 
and Receiver Operating Characteristic (ROC) 
Curve: The ROC curve illustrates the trade-off 
between true positive rate (TPR) and false posi- 
tive rate (FPR) at various categorization thresh- 
olds. 


4. Dataset Description 

4.1. Description of the dataset used for training 
and testing : 

Data collection and preprocessing are essential pro- 

cedures to make sure there are high-quality and rep- 
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resentative datasets available for training and assess- 
ment purposes in the context of face mask recog- 
nition using CNN. To train a reliable CNN model 
for face mask identification, a broad and balanced 
dataset must be gathered. A total of 7553 pho- 
tos, including 3725 photographs of people wearing 
masks and 3828 images of people without masks, 
make up the dataset that was picked from kaggle 
[22]. Make sure the dataset includes a wide range of 
demographics, positions, lighting setups, and mask 
type variants. Once the dataset has been gathered, 
it has to be annotated to show whether or not each 
image contains a face mask. The obtained dataset is 
preprocessed in order to improve its quality and sup- 
port efficient learning before being used to train the 
CNN model. The photographs are typically resized 
to a uniform resolution, the pixel values are nor- 
malised, and the dataset is augmented using meth- 
ods such random rotations, translations, and flips. 
Preprocessing aids in resolving concerns with differ- 
ent picture sizes and orientations as well as lowering 
the computing complexity of the model. 


4.2. Data augmentation techniques applied : 


Data augmentation techniques are employed to aug- 
ment the training dataset, increasing its diversity 
and size. These techniques introduce variations to 
the training images without altering their underly- 
ing labels. By doing so, the model becomes more 
robust and better equipped to handle variations in 
real-world scenarios. 


4.3. Preprocessing steps for face extraction : 


Preprocessing plays a vital role in the successful 
implementation of face mask detection using CNN 
by improving the quality and consistency of face 
images. These steps include face alignment tech- 
niques to normalize orientation and pose, image 
resizing to ensure uniformity, image normalization 
to standardize pixel intensities, and data augmen- 
tation to expand the training dataset by introducing 
variations. By employing appropriate face detection 
algorithms, effective face localization techniques, 
and comprehensive preprocessing steps for face 
extraction, CNN-based face mask detection systems 
can accurately locate and extract face regions from 
images or video frames, enabling robust and reliable 
analysis for accurate face mask detection. 


2023, Vol. 05, Issue 08 August 


5. CNN Model Architecture 
1. Description of the CNN architecture used 


Convolutional layers, activation functions, pooling 
layers, and fully connected layers make up the CNN 
architecture utilised for facemask detection. This 
design generally has a sequential structure. The net- 
work will be able to distinguish between masked and 
unmasked faces because to the architecture’s goal 
of learning hierarchical representations of the input 
pictures. 


CONVOLUTION SAMPLING CONVOTIONAL SAMPLING 
1 1 2 2 


FEATURE EXTRACTION AND CLASSFICATION 
(CNN ARCHITECTURE, 


FIGURE 1. Architecture of proposed system 


2. Number of layers and their configurations: 

a. Convolutional Layers: The fundamental 
components of the CNN architecture are convo- 
lutional layers. Convolutional filters are used to 
extract local characteristics from the input pictures. 
Depending on the complexity of the job and the 
dataset, the design may include fewer or more con- 
volutional layers. For facemask detection, a CNN 
typically has numerous convolutional layers, often 2 
to 5 or more. Filters and kernel sizes that are cus- 
tomised to each layer allow it to capture diverse pat- 
terns and characteristics at varying spatial scales. 

b. Activation Functions: The network may 
learn intricate correlations between the input data 
and the related labels thanks to activation functions, 
which add non-linearities into the network. Recti- 
fied Linear Units (ReLU), a popular activation func- 
tion with a proven track record for handling the van- 
ishing gradient problem and promoting quicker con- 
vergence, are frequently utilised in CNN architec- 
tures for facemask identification. Depending on the 
particular architecture design, other activation func- 
tions may also be used, such as Leaky ReLU or Sig- 
moid functions. 

c. Pooling Layers: Convolutional layer feature 
maps are sampled by pooling layers down, which 
reduces their spatial dimensions while preserving 
the key characteristics. Max pooling, which retains 
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the maximum value inside each pooling zone, is 
a popular technique. Pooling facilitates translation 
invariance and lowers the network’s computational 
complexity. Typically, the pooling layers come after 
the convolutional layers, gradually lowering the fea- 
ture maps’ spatial resolution. 

d. Fully Connected Layers: The CNN design 
often has fully linked layers at the very end. These 
layers enable the network to learn high-level rep- 
resentations and generate class predictions by con- 
necting every neuron from the previous layer to the 
neurons in the following layer. Depending on the 
particular CNN design, the number of completely 
linked layers and the number of neurons in each 
layer might change. 


6. Experimental Result and Analysis 


6.1. Overview of the live video streaming setup for 
testing the trained model 


A live video streaming setup was used to assess how 
well the trained model performed in detecting face 
masks. This configuration made it possible to assess 
the model’s precision and efficiency in a changing 
environment. The set-up for live video streaming 
involved taking video frames from a camera feed 
and processing them using a convolutional neural 
network (CNN) model that had been trained. 


6.2. Description of the hardware and software 
used: 


The experimental setup utilized specific hardware 
and software components to facilitate the live video 
streaming and testing process. The hardware com- 
ponents included a high-quality camera capable of 
capturing clear and high-resolution video footage. 
Examples of cameras used in similar studies include 
webcams or dedicated surveillance cameras with 
appropriate resolution and frame rate capabilities. 
These frameworks provided the necessary tools and 
libraries for handling video streams, image process- 
ing, and integrating the trained CNN model for 
inference. 


6.3. Discussion of specific challenges or 
considerations in the experimental setup: 


The experimental setup for detection of face mask 
using a CNN approach may involve several chal- 
lenges and considerations. One significant chal- 
lenge is ensuring the real-time performance of the 
system, especially when processing video streams. 


2023, Vol. 05, Issue 08 August 


The CNN model’s inference speed and the com- 
putational resources available play crucial roles in 
achieving real-time detection and maintaining a 
smooth video stream. The hardware used should 
be capable of handling the computational demands 
of running the CNN model on the incoming video 
frames. Another consideration is the variability in 
lighting conditions and environmental factors that 
can affect the performance. 


6.4. Results and Analysis 


By integrating three important variables—Model, 
Learning Rate, and With/Without Mask Dis- 
tance—our study achieves ground-breaking out- 
comes. In our studies, we assessed the effects of 
these variables combined with Multiple People Cap- 
turing and Blur Image Quality. We found consider- 
able variances in the calculated lengths across three 
different circumstances. The distances measured 
for Model 1, with a Learning Rate of le-4, were 
161 cm with a mask and 190 cm without a mask, 
demonstrating promising accuracy. Model 2 pro- 
duced marginally shorter distances for masked and 
unmasked people of 155 cm and 187 cm, respec- 
tively, with a Learning Rate of le-3. The shortest 
measured distances were created by Model 3 with 
a Learning Rate of le-2, measuring 146 cm with a 
mask and 179 cm without a mask. Our study empha- 
sises the need of taking blurr image quality and mul- 
tiple people capturing into account for reliable and 
precise measurements, as well as the effect of these 
aspects. We provide important insights through our 
distinctive methodology and establish a new bench- 
mark for further research in this area. 


Facemask Detection Model Comparison 


GB With Mask Distance 
| GS Without Mask Distance 


Distance (cm) 
id 
So 
°o 


Model 1 Model 2 Model 3 
Model 


FIGURE 2. Face Mask Detection Performance 
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TABLE 1. Result test cases 


Model Learning Rate With Mask Without Mask Blur Image Multiple 
Distance(in cm) Distance(incm) Quality People 
Capturing 
1 le-4 161 190 Great 4 
2 le-3 LSS 187 Average 3 
3 le-2 146 179 Not good 3 


6.5. Presentation of the accuracy of the trained 
model in detecting masks: 


The outcomes of the face mask identification util- 
ising the CNN technique showed encouraging pre- 
cision in identifying masks in practical situations. 
Utilising a variety of criteria, including precision, 
recall, Fl-score, and total accuracy, the trained 
model’s correctness was assessed. The assessment 
measures shed light on how well the model can 
distinguish between instances of mask wearers and 
non-maskers. 


TABLE 2. Performance Evaluation Metrics 
Metrics Overall (%) 
Accuracy 95 


Precision 92 
Recall 96 


Performance Metrics 


Overall (%) 


Precision 
Metrics 


Accuracy 


FIGURE 3. Performance Metrics of the Face 
Mask Detection Model 


For instance as shown in Table — 2 , the trained 
CNN model achieved an overall accuracy of 95% 
in correctly detecting individuals wearing masks in 
a test dataset of real-world images. The precision 
and recall values were found to be 92% and 96% 


respectively, indicating a balanced performance in 
correctly identifying both positive (mask-wearing) 
and negative (non-mask-wearing) instances. Graph 
for which is shown in Figure — 3. 


7. Conclusion 


To summarise, the goal of this research study was to 
create a face mask detection system utilising CNN 
to handle the problems brought on by the COVID- 
19 epidemic. The major goals were to create and 
train a CNN model that could successfully identify 
people wearing masks in practical situations, as well 
as to assess the model’s performance in various sce- 
narios. Even though convolutional neural networks 
(CNNs) have significantly advanced face mask iden- 
tification, there are still many directions for fur- 
ther study. Areas for improvement include enhanc- 
ing accuracy through refined training processes and 
advanced architectures, developing real-time detec- 
tion algorithms, improving robustness to environ- 
mental factors and occlusions, addressing privacy 
and ethical considerations, generalizing to differ- 
ent demographics, deploying systems in real-world 
scenarios, integrating with existing surveillance sys- 
tems, exploring multimodal approaches, and estab- 
lishing standardized evaluation metrics and bench- 
mark datasets. These areas of focus will drive 
advancements and ensure the effectiveness of face 
mask detection systems in various practical applica- 
tions. 


The findings of this study have concluded with 
the effectiveness and accuracy of the presented mask 
detection mechanism. The trained CNN model 
achieved an impressive overall accuracy of 95% in 
detecting masks, with high precision and recall val- 
ues. The model showcased robust performance even 
in complex situations, such as multiple people at dif- 
ferent distances, highlighting its potential for real- 
time mask monitoring in public places. The signif- 
icance of the proposed mask detection mechanism 
in public places cannot be overstated. By provid- 
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ing an automated and reliable solution for identify- 
ing mask-wearing individuals, the system can con- 
tribute to the enforcement of mask-wearing proto- 
cols, thus reducing the risk of virus transmission. 
The implementation of this mechanism in various 
public settings, including offices, airports, and shop- 
ping malls, can play a crucial role in maintaining 
public health and safety. 

Moreover, this research opens up avenues for 
future research and practical applications. Fur- 
ther investigations can explore the extension of 
the model to detect other aspects of mask compli- 
ance, such as improper mask usage or the iden- 
tification of different types of masks. Addition- 
ally, the integration of the proposed mechanism 
into existing surveillance systems or security frame- 
works can enhance public health monitoring capa- 
bilities. Practical applications of the mask detec- 
tion mechanism can be diverse and widespread. 
It can be integrated into CCTV systems in pub- 
lic places, enabling real-time monitoring and alert- 
ing authorities or security personnel in case of non- 
compliance. The mechanism can also be deployed 
in mobile applications, allowing individuals to self- 
assess their mask-wearing habits and receive notifi- 
cations or reminders for adherence. 
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